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Abstract. Most current studies estimate the invulnerability of complex networks 
using a qualitative method that analyzes the inaccurate decay rate of network efficiency. 
This method results in confusion over the invulnerability of various types of complex 
networks. By normalizing network efficiency and defining a baseline, this paper defines 
the invulnerability index as the integral of the difference between the normalized 
network efficiency curve and the baseline. This quantitative method seeks to establish 
a benchmark for the robustness and fragility of networks and to measure network 
invulnerability under both edge and node attacks. To validate the reliability of 
the proposed method, three small-world networks were selected as test beds. The 
simulation results indicate that the proposed invulnerability index can effectively and 
accurately quantify network resilience. The index should provide a valuable reference 
for determining network invulnerability in future research. 
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1. Introduction 

Recently, the work of the pioneering scientists Albert et al. on network resilience 
has attracted much attention [1]. Their widely accepted results indicate that scale- 
free networks such as the Internet and the World Wide Web exhibit a high robustness 
to random failures, yet they are extremely vulnerable to selective attacks. Albert et 
al. [T] discussed only node attacks, whereas Holme et al. subsequently extended the 
concepts of edge degree and edge betweenness, exploring the notion of invulnerability 
under edge attacks [5J. Furthermore, Motter [3], Holme [I] and other researchers have 
investigated the dynamic effects of networks under node and edge attacks. Song et al. 
have also discussed the fragility of fractal networks[5]. The real- world networks, such 
as metabolic networksp], scientific cooperation networks[8], social networks [9| IT0[ ITT], 
and email networks [12] etc. have also been investigated [6]. The results of these studies 
support the claim of Albert et al. that robustness and fragility can coexist in scale-free 
networks [JJ. However, in most of these studies, network invulnerability was analyzed by 
qualitatively assessing the decay rate of network efficiency. 

Recent studies have shared new perspectives in understanding network resilience. 
Schneider et al. p2] proposed a method for generating scale-free networks with an onion 
structure and defined a robustness measure for estimating the network invulnerability 
under node attacks, by summing the fractions of nodes in the largest connected cluster. 
Then, they reached the conclusion that, although still fragile, such onion-structure 
networks are the most robust scale-free networks against malicious attacks. However, 
this measure is suitable only for estimating network resilience under node attacks and 
does not define a benchmark for distinguishing robustness and fragility. In the future, 
other types of networks should be investigated. In another study, Zheng et al. [U] 
also proposed a method for defining robustness under node attacks, but no quantitative 
calculation method was provided. 

The current paper tries to provide a quantitative method to accurately characterize 
robustness and to clearly distinguish the robustness and fragility of networks. Moreover, 
this method should be able to address both node and edge attacks. To meet these aims, 
we first normalize the network efficiency using the normalized size of the removed edges 
and define a baseline as a benchmark for distinguishing robustness and fragility of the 
network under an attack. Second, to quantify the network invulnerability under an 
attack strategy, i.e., a sequence of attacks, we define the invulnerability index as the 
integral of the normalized network efficiency curve minus the baseline. 

To validate the proposed method, we explore the robustness of small-world 
networks. Two small-world networks from the real world and one WS model network [T5] 
are tested under edge and node attacks. The simulation results demonstrate that the 
proposed method is effective at quantifying network resilience under node and edge 
attacks and at distinguishing network robustness and fragility. 
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2. The quantitative method 

The proposed method aims to 1) address both node and edge attacks and 2) provide 
a clear and quantitative demarcation between robustness and fragility. To achieve the 
first aim, the proposed method suggests normalizing the network efficiency using the 
normalized size of the removed edges. Because node attacks can be regarded as attacks 
on bundles of edges, this method is capable of simultaneously addressing node and edge 
attacks. To provide a clear and quantitative distinction between robustness and fragility 
under an attack, this method defines a continuous curve-the baseline, and then defines 
the invulnerability index as the integral of the normalized network efficiency curve minus 
the baseline, then establishes a benchmark such that if the invulnerability index of a 
network is greater than 0, then the network is robust; otherwise, the network is fragile. 



2.1. Normalized network efficiency 

Previous studies have proposed many methods for measuring network efficiency [121 
|2]. In this paper, we use the size of the giant component of a given graph to determine 
network efficiency after a set of edges has been removed. When dealing with node 
attacks, the removal of nodes is regarded as the removal of bundles of edges. For 
example, the removal of node A can actually be regarded as the removal of all of the 
node's edges. 

Assume that an initial network has N nodes and E edges. After the removal of 
a set of edges R, the size of the giant component is s(R). The normalized network 
efficiency with respect to the normalized size of the removed edges r can be expressed 
as equation ([T]). In this equation, s(r) G [0, 1], and r 6 [0, 1]. 

s (r) = — (1) 

where 

r = — (2) 

E V ' 

Because the number of edges is an integer (i.e., is discrete), any function of 
the normalized network efficiency is also discrete. To define a baseline and compute 
the invulnerability index, the function of the normalized network efficiency should be 
continuous. We use the most common method for creating this function: i.e., linking 
adjacent discrete points one by one to form a network efficiency curve. 

However, one problem associated with node attacks should be mentioned. The 
removal of a node is regarded as the removal of a bundle of edges. Because the sequence 
in which the edges are removed is uncertain, the normalized network efficiency varies. 

To simplify this problem, this paper suggests a method of interpolation. 

Assume that the network efficiencies before and after node removal are s, and Sj, 
respectively; thus, the network efficiency after the removal of m edges is defined by 
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equation fl3]). 

s m = Si + m _ l (sj -s^ (i<m< j) (3) 

Using this method of interpolation, the node attacks are generalized to edge attacks, 
and the network efficiency under node attack approximates the network efficiency under 
edge attack. 



2.2. Definition of baseline 

When a network is randomly or maliciously attacked, a coordinate is often 
established to characterize the relationship between the attack and the network efficiency 
(see Fig. 1). In Fig. 1, the horizontal axis represents the proportion of edges removed 
under each attack, and the vertical axis represents the normalized network efficiency. 
Each attack and the corresponding network efficiency is plotted as a point. After a 
sequence of attacks, the relationship between the attacks and the network efficiency is 
plotted as a discrete curve. 




Figure 1. Measurement of network invulnerability by the integral difference. The 
vertical axis s(r) is the normalized network efficiency, and the horizontal axis r 
represents the fraction of removed edges. The continuous blue diagonal line is the 
baseline; the green dashed curve is the discrete network efficiency curve, and P is a 
point on the discrete curve. When calculating the invulnerability index, the discrete 
network efficiency curve should be continuous. The discrete adjacent points A and 
B are continued as curve AB. The shaded region represents the integral difference 
between the baseline and the network efficiency curve: i.e., the invulnerability index. 

Most current methods define network invulnerability by considering the decay rate 
of network efficiency. In this paper, we set a baseline so as to more easily recognize the 
robustness or the fragility in attack graphs and to mark the critical point at which the 
fragility of a graph emerges. The baseline is shown as the blue diagonal curve in Fig. 1 
and evenly divides the area of the coordinates into two parts. 
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Mathematically, the baseline can be formalized as equation flU). 



f(r) = 1 - s(r) 



(4) 



The baseline is used as a strict demarcation to distinguish the network robustness 
from the fragility in the attack graph. When a point on the network efficiency curve is 
above the baseline, the network is robust to a given attack; otherwise, it is fragile. For 
example, point P in Fig. 1 is above the baseline under a given attack; therefore, the 
network is robust to this attack. Correspondingly, when a sequence of points is above 
the baseline, the network is robust to a given series of attacks. Otherwise, the network 
is fragile. 

However, when some points are above the baseline and some under the baseline, a 
demarcation is needed. Moreover, a more accurate demarcation of the invulnerability of 
the entire network is also required. Thus, this paper uses the integral of the difference 
of the normalized network efficiency curve minus the baseline to quantify the network 
invulnerability (section 2.4). 

2.3. The invulnerability index 

Based on the normalized network efficiency curve and the baseline, this paper 
defines the invulnerability index / as the integral of the normalized network efficiency 
curve minus the baseline. When a certain fraction of edges (denoted as a) are removed, 
the invulnerability index is denoted as I a . As seen in Fig. 1, shaded with vertical blue 
lines represent the index. Formally, l a is defined as equation (JSJ). 



The key to solving equation ([6]) is calculating the integral J a s(r) dr of the 
normalized network efficiency curve. In fact, here the integral Jq s(r) dr is the area 
bounded by the normalized network efficiency curve and the coordinate axes. 

Because E is the number of edges, the discrete normalized network efficiency 
function s(r) has E + 1 points { Sq, s\, se}, including the initial graph without any 
attacks. Therefore, the area A a bounded by the normalized network efficiency curve 
and the coordinate axes satisfies equation (fl2~j) when a percentage a of the edges are 
removed as follows: 




(5) 



Equation (jSJ can be rewritten as equation (EJ). 




(6) 




(7) 



where 



e = 



\a*E] 



(8) 
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Thus, equation can be rewritten as equation. 

a 2 

I a = A a + (—- a) (9) 

Because the domain of the coordinates is [0,1], so < A a < 1. Therefore, 
I a E [-0.5,0.5]. 

Thus, the invulnerability index can be quantified. In extreme cases, the network 
efficiency curve may completely overlap the baseline, I a = 0. When the area above the 
baseline bounded by a fraction of points, a, is smaller than the area under the baseline, 
I a < 0, the network is fragile; otherwise, I a > 0, and the network is robust. With the 
same fraction of removed edges, a larger invulnerability index indicates a more robust 
network. 

In particular, when all of the edges are removed from a network, the invulnerability 
index is represented by equation (TT0|) , and the index Ii is often used by researchers to 
estimate network resilience. 

I x = j (sir) -l + r)dr (10) 
Jo 

When a = 1, we can calculate the invulnerability index using a simple method. 
According to geometry, the area bounded by the baseline and the coordinate axes is 
always a constant 1/2, therefore, I\ is the area bounded by the network efficiency curve 
and the coordinate axes minus the constant 1/2, which is written as equation (lllj) . 

h=M-\ (11) 

4 1 = ( ££±£i + £i±il + ... + £ti±££)* ; i (12) 



where 



Note that So = 1 and se = 0; thus, equation ( TTTj) can be rewritten as equation ( fl3l) . 

h = A, - \ (13) 
where A\ can be rewritten as equation ( fHl) : 

E 

i=o 2 h 

The invulnerability index 1\ only reflects the final status of the robustness or 
fragility. Under many circumstances, we need to know the status of a network under 
specific attacks. Therefore, this paper suggests defining a few commonly used network 
invulnerability indexes- 7o.2, h.s, h.7 an d h.o, which correspond to the removal of 20%, 
50%, 70% and 100% of the edges, respectively, under given attack strategies. 
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3. Simulation experiments and results 

To verify the proposed method, we select three commonly used small-world 
networks as test beds: the neural network of the worm Caenorhabditis elegans (C. 
elegans), the power grid of the western United States (Powergrid) and the WS modelfTB"] 
network. The first two are real-world networks, and the WS model network is derived 
from the model of Watts and Strogatz. In the WS model, the number of local connections 
can be defined as r = 10 and the rewiring probability as p = 0.02. Some properties of 
the tested networks are listed in Table 1. 

Table 1. Properties of tested networks. N is the number of nodes, E is the number 
of edges, D is the average degree. 



Network 


N 


E 


D 


C. elegans 


453 


2040 


8.94 


Powergrid 


4941 


6594 


2.66 


WS-model 


1000 


10000 


20 



3.1. Simulation experiments and analysis 

Commonly, two classes of attack strategies are widely used [2]. The first class is 
the random attack strategies, i.e., to remove the nodes or the edges arbitrarily from 
a tested network. The seconde class is the selective attack strategies, i.e., to remove 
the most important nodes or the edges one by one. Based on different definitions on 
the importance of the nodes or the edges, different selective attack strategies can be 
used[14j. In most circumstances, the importance used in the previous studies are the 
degree and the betweenness. Moreover, the calculation method of the importance can 
also affect the selective attack strategies, that is, we can only calculate the importance 
when before the attacks, termed as "based on initial graphs", or we can calculate the 
importance after every attack, termed as "based on recalculation". In general, the 
attack strategies widely used are random, ID (Initial Degree), IB (Initial Betweenness), 
RD (Recalculation Degree), RB (Recalculation Betweenness). Because the experimental 
results of two calculation methods are similar, here we only choose the attack strategies 
based on the initial graphs. 

The simulation results of the proposed method under random, ID and IB attacks 
on nodes and edges are shown in Fig. 2 and Fig. 3, respectively, and the corresponding 
invulnerability indexes are shown in Tables 2 and 3. 

3.2. Simulation experiments under edge attacks 

In these experiments, three widely used attack strategies are used, that is , the 
random, ID and IB edge attacks. Here, the degree of an edge [2] is the product of the 
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(a) Random attack (b) ID attack (c) IB attack 

Figure 2. Simulation experiments under edge attacks. s(r) is the normalized network 
efficiency, which is measured by the size of the giant component, and r is the percentage 
of edges removed. The yellow (triangles) curves represent the Powergrid network tested 
under corresponding random, ID and IB attacks; the red (rectangles) curves represent 
the C. elegans network; the green (circles) curves represent the WS model network; 
and the black line is the baseline. The D in ID indicates edge degree, and B in IB 
indicates the edge betweenness. 

degrees of the linked nodes, and the betweenness of an edge [2] is defined as the number 
of shortest paths between pairs of nodes that run along the edge. 

The results of experiments on networks under random and selective attacks are 
shown in Fig. 2, and the corresponding indexes are listed in Table 2. 

Table 2. The invulnerability indexes of networks tested under edge attacks. 



Network 


Strategy 


-^0.2 


^0.5 


Io.7 


h.o 


C. elegans 


Rn 


0.018 


0.120 


0.206 


0.286 




ID 


0.008 


0.070 


0.120 


0.166 




IB 


0.008 


0.070 


0.138 


0.166 


Powergrid 


Rn 


0.008 


-0.152 


-0.302 


-0.390 




ID 


0.0004 


-0.112 


-0.188 


-0.234 




IB 


-0.062 


-0.240 


-0.316 


-0.358 


WS model 


Rn 


0.020 


0.126 


0.246 


0.406 




ID 


0.020 


0.112 


0.210 


0.300 




IB 


0.020 


0.114 


0.194 


0.206 



In Fig. 2(a), most points on the network efficiency curves for the C. elegans (the red- 
rectangle curve) and the WS model network (the green-triangle curve) are clearly located 
above the baseline under random attacks. In contrast, a large proportion of points on the 
network efficiency curve of the Powergrid network (the yellow-circle curve) are below the 
baseline. Similarly, in Fig. 2(b) and (c), the relative locations of the network efficiency 
curves and the baseline do not change, yet the network efficiency curve of the Powergrid 
(yellow-circle curve) deviates greatly from the baseline as compared with that in Fig. 
2(a). Based on the definition of the baseline, we can determine that the C. elegans and 
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WS model networks are robust and the Powergrid network is fragile under the three 
types of attacks. 

Regarding the quantified network invulnerability (Fig. 2), the corresponding 
invulnerability indexes Jq.2, I0.5 an d I0.7 indicate changes in network invulnerability, 
while the invulnerability index I\ is used to estimate network resilience. For the networks 
tested under the random attack strategy, the four invulnerability indexes of the WS 
model and C. elegans networks are positive and increase to 0.406 and 0.286, respectively. 
By contrast, the Powergrid becomes fragile when over 50% of the edges are removed 
because the network's index becomes negative at J0.5, and I\, which is the smallest index 
value, equals —0.234. From Fig. 2 and Table 2, we can see that the WS model network 
is the most robust among those tested, and the Powergrid is the most fragile. 

Similarly, under ID and IB attacks, the C. elegans and WS model networks are 
robust: the WS model network is the most robust, and the Powergrid is the most 
fragile. 

3.3. Simulation experiments under node attacks 

The simulation results and the four corresponding invulnerability indexes are shown 
in Fig. 3 and Table 3, respectively. 

Similar to the edge attack strategy, the random attack strategy on nodes involves 
removing nodes arbitrarily from the tested networks. In addition, selective attack 
strategies aim to remove nodes in the descending order of node degree or node 
betweenness based on the initial graphs. The degree of a node is the number of 
connections that it has to other nodes, and the betweenness of a node is defined as 
the number of shortest paths from all vertices to all others that pass through that node. 




(a) Random attack (b) ID attack (c) IB attack 

Figure 3. Simulation experiments under node attacks. The degree and betweenness 
are actually the node degree and the node betweenness, respectively. 

The relative locations between the network efficiency curves and the baseline in 
Fig. 3 are similar to those shown in Fig. 2, i.e., most of the points on the network 
efficiency curve of the WS model and C. elegans networks are above the baseline, and 
almost all of the points on the network efficiency curve of the Powergrid are below the 
baseline. However, compared with those shown in Fig. 2, the network efficiency curves 
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Table 3. The invulnerability indexes of networks tested under edge attacks. 



Network 


Strategy 


Iq.2 


h.5 


h.7 


h.o 


C.elegans 


Rn 


0.146 


0.178 


0.180 


0.194 





ID 


0.095 


0.111 


0.128 


0.135 




IB 


0.103 


0.137 


0.154 


0.188 


Powergrid 


Rn 


-0.005 


-0.083 


-0.115 


-0.128 




ID 


0.0007 


-0.094 


-0.134 


-0.156 




IB 


0.116 


-0.076 


-0.156 


-0.201 


WS model 


Rn 


0.143 


0.159 


0.193 


0.207 




ID 


0.119 


0.141 


0.189 


0.267 




IB 


0.108 


0.130 


0.170 


0.265 



drop much more rapidly because the removal of a node indicates the removal a bundle 
of edges to which the node is connected. 

To be specific, under random attacks on the edges (Fig. 3(a)), the network 
invulnerabilities of the C. elegans and WS model networks satisfy the condition I\ > 0, 
and the invulnerability of the latter is slightly larger than that of the former. The value 
of I\ for the Powergrid is negative and is clearly the smallest I\ value. From Fig. 3 and 
Table 3, we can see that the WS model network is the most robust among the networks 
tested under random attacks, and the Powergrid is the most fragile. 

In addition, in Fig. 3(b) and (c), the invulnerability index l\ indicates the order 
of network robustness. Thus, the WS model network is the most robust under ID and 
IB attacks, and the Powergrid is the most fragile. From Table 3, we can see that the 
invulnerability index of the Powergrid becomes negative when a = 0.5. This indicates 
that the network efficiency of the Powergrid decreases more rapidly when over 50% of 
the network's edges are removed. 

From the simulation experiments discussed above, we can draw a few conclusions: 1) 
The proposed method can simultaneously estimate network invulnerability under both 
edge attacks and node attacks; 2) small-world networks can be robust under selective 
attacks; 3) the robustness or fragility of a specific network is consistent under node 
attacks and edge attacks. 

4. Conclusion 

In this paper, we proposed an invulnerability index and demonstrated a calculation 
method to quantify network robustness. Using small-world networks as test beds, the 
validity of the proposed index was evaluated. The experimental results indicated that 
the proposed index could work under both node and edge attacks. Moreover, the 
invulnerability index under node attacks approximates that under edge attacks. In 
other words, the distinctions between the robustness and the fragility of the tested 
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networks were consistent. Moreover, the method provided an accurate demarcation by 
which to distinguish the robustness from the fragility. When addressing the behavior of 
network efficiency under a series of attacks, the method also provided the critical point 
at which a network becomes fragile. The experimental results indicate that two of the 
tested networks are robust to selective and random attacks, whereas the third is fragile: 
i.e., small- world networks can be robust to selective attacks. 

In the future, we will use the method reported here to investigate the robustness of 
other network types to different attack strategies such as PageRank[16j [TTj attacks and 
HITS [IE] attacks. 
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